A Data Preparation Framework based on a Multidatabase Language

نویسندگان

  • Kai-Uwe Sattler
  • Eike Schallehn
چکیده

Integration and analysis of data from different sources have to deal with several problems resulting from potential heterogeneities. The activities addressing these problems are called data preparation and are supported by various available tools. However, these tools process mostly in a batch-like manner not supporting the iterative and explorative nature of the integration and analysis process. In this work we present a framework for important data preparation tasks based on a multidatabase language. This language offers features for solving common integration and cleaning problems as part of query processing. Combining data preparation mechanisms and multidatabase query facilities permits applying and evaluating different integration and cleaning strategies without explicit loading and materialization of data. The paper introduces the language concepts and discusses their application for individual tasks of data preparation.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integrating of File Systems and Multidatabase Systems Based on Xml

Integration between file systems and multidatabase systems is a necessary approach to support data sharing from distributed and heterogeneous data sources. Based on analyzing the problem about integration between file systems and multidatabase systems, the paper proposes use XML (eXtensible Markup Language) to integrating file systems into multidatabase systems. A common data model named XIDM, ...

متن کامل

Research and Implementation of an Multidatabase System Based on Corba and Xml

A multidatabase system is an effective approach to implement data sharing and interoperability among many distributed and heterogeneous data sources. In this paper, a CORBA-based architecture model of multidatabase system is firstly introduced. Then, an XML-oriented common data model, named XIDM, is presented. These models conform to the characteristics of multidatabase systems, such as autonom...

متن کامل

Query Decomposition, Optimization and Processing in Multidatabase Systems

One way of achieving interoperability among heterogeneous, federated DBMSs is through a multidatabase system that supports a single common data model and a single global query language on top of different types of existing systems. The global schema of a multidatabase system is the result of a schema integration of the schemas exported from the underlying databases, i.e., local databases. A glo...

متن کامل

Mobile Agent Based Self-Adaptive Join for Wide-Area Distributed Query Processing

Mobile and wireless services are new and emergent areas of research. There are many new and unresolved issues in the area. For example, research is needed in the areas of mobile database, network, architecture, security, privacy, trust, and agent. In this issue, we have two papers that look at mobile agent applications for data retrieval. In the first paper, 'Applica-tion of Mobile Agents in Mo...

متن کامل

Extending a Multidatabase Manipulation Language to Resolve Schema and Data Conflicts

The management of Multidatabase Systems (MDBS) is complicated by possible structural and semantic heterogeneity of the member database systems, and the requirements to preserve their local autonomy. Semantic heterogeneity is concerned with the di erences in the meaning and interpretation of similar data objects across the di erent systems. In loosely coupled database federations, static schema ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001